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gXPRgSSIQK ^ CTPQRT TECHHOLOCnf 
OF PROTglNS ^ IMMOKO FPSnrS 

This patent application is a continuation-in-part of U.S. Serial 
Number 08/305, 700 « filed on September 14 « 1994. 

5 Background 

The invention relates to fusion protein e3q)re88ion systems for use 
in mammalian cells that enhance the production of a given target 
protein. More specifically, the invention relates to a secretion 

10 cassette^ comprised of a mammalian signal peptide and a portion of 
mammalian immunoglobulins, which* when used as the amino^terminal 
fusion partner to the target protein, .generally leads to high level 
expression and secretion of the fusion product. Such fusion proteins 
are useful, for example, for the production and extracellular 

15 collection of target proteins without the need Sor lysis of a host 

cell. The invention is perhaps most useful for the eicpression of target 
proteins which are not normally secreted from a host cell, are secreted 
at low levels from a host cell, or are toxic or otherwise deleterious 
to a host cell • 

20 

Expression systems eiqploying gene fusion constructs have been used 
to enhance the 'production of proteins in bacteria. Bmploying a 
bacterial protein that is normally eaqpressed at a very high level as 
the amiho-texninal fusion partner of a fusion protein helps ^o ensiire 
25 efficient transcr^tion and translation of the message, and in some 

cases the secretion and solubilisation of the fusion protein (Smith and 
Johnson (1988) Gene 67s3Xi Hopp et al. (1988) Biotechnology 6tl204i La 
Vallie et al. (1993) Biotechnology 11:187). 

30 The major goal of expression of recombinant fusion proteins in 

mammalian cells has been to confer novel properties to the hybrid 
molecules, e.g., targeting of a cytokine or toxin in v?.vg# Fc receptor 
binding, coiqplement fixation, protein A binding, increasing the half- 
life, and crossing the blood-brain barrier. Examples of recomihinant 

35 fusion proteins produced in nanmaiian cells include cytokine 
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imraunoconjugates (Gillies et al. (1992) Proc. Natl. Acad, Sci, USA 
89:1428; Gillies et al . <1993) Bioconjugate Chemistry 4:230), 
immunoadhesins (Capon et al. (1989) Nature 337:525), immunotoxins 
(Chaudhary et al. (1989) Nature 339:394), and a nerve growth factor 
5 conjugate (Friden et al. (1993) Science 259:373). Each of the 

foregoing publications is incorporated herein by reference. Proteins 
produced in mammalian cells often do not have the solubility and 
secretion problems encountered in bacterial expression. The use of 
gene fusion constructs to enhance the production or secretion of a 
10 target protein in a mammalian system has not been explored fully. 

It is the object of the invention to provide DNAs which facilitate 
the production and secretion of a target protein. In particular, 
objects of the invention are to provide novel DNAs which: facilitate 

15 efficient production and secretion of hard to express proteins, such as 
nuclear proteins, regulatory ^jrote ins and proteins which otherwise may 
be toxic to a host cell, and can be adapted to any target polypeptide 
of interest which can be coded for and expressed in a host organism; to 
provide DNA constructs for the rapid and efficient production and 

20 secretion of proteins in a variety of host cells; and to provide a 
method for the production, secretion and collection of genetically 
engineered proteins, including non-native, biosynthetic, or otherwise 
artificial proteins « such as proteins i^ich have been created by 
rational design. Other objects of the invention are to provide DKA 

25 sequences which, when fused to a polynucleotide encoding a target 

protein, encode a fusion polypeptide whicdi can be purified using common 
reagents and techniques, and to interpose a proteolytic cleavage site 
between the encoded secretion cassette and the encoded target protein 
such that the secretion cassette can be cleaved from the target protein 

30 and the target protein can be purified independently. Still another 
object is to provide a procedure which is both efficient and 
inexpensive « 

These and other dbjects of the invention will be apparent from the 
35 description, drawings, and claims that follow. 



SUBSTITUTE SHEET (RULE 26) 



PCTAJS95/11720 



.3- 

fitimmarv Of The Invention 

The present invention features a DNA of general applicability for 
production and secretion of fusion proteins. The DNA comprises a 
5 secretion cassette, as the amino -terminal fusion partner, and a target 
protein, and is termed herein an •immunofusin- . The invention 
provides, in its various aspects, a recombinant DKA encoding the 
iramunofusin, and methods of producing the encoded immunofusin protein. 
The immunofusin is a DNA which conprises a polynucleotide encoding a 

10 secretion cassette, comprising in its 5* to 3" direction a signal 

sequence and an immunoglobulin Pc region, and a polynucleotide encoding 
a target protein fused to the 3* end of the secretion cassette. A 
secretion cassette of the invention, once constructed, can be fused to 
various target proteins. Additionally, one can optimize the sequences 

15 which regulate the expression of a secretion cassette « and hence the 
e3q>ression of the immunofusin. The resultant DNA can be expressed at 
hi^ levels in a host cell, and the fusion protein is efficiently 
produced and secreted from the host cell. The secreted immunofusin can 
be collected from the culture media without the need for lysis of the 

20 host cell, and can be assayed for activity or purified using common 
reagents as desired. 

Ttie portion of the WA encoding the signal sequence preferably 
encodes a peptide segment tdiich directs the secretion of the 

25 immunofusin protein and is thereafter cleaved. As used in the 
specification and claims « "immunoglobulin Pc region" means the 
caxboxyl -terminal portion of an inmunoglobulin heavy chain ponstant 
region. As is knom, eacdi immunoglobulin heavy thain constant region 
is coiqprised of four or five domains. The domains are named 

30 sequentially as follows: GKl-hinge*CR2-C3i3(-CH4} , and the Pc region of 
each immunoglobulin subclass lacks at least the CHi domain. As is 
apparent froih a review of the DNA sequences of the immunoglobulin 
subclasses, the WA sequences of the heavy chain domains have cross - 
homology among the immunoglobulin classes, e.g., the CH2 domain of IgG 

35 is Voraologous to the CH2 domain of IgA and IgD, and to the CH3 domain 
of IgM and igB. The portion of the DNA encoding the immunoglobulin Pc 
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region preferably comprises at least a portion of a hinge domain, and a 
CH3 domain of Fey or the homologous domains in any of IgA, IgD, IgE, or 
igM. The immunoglobulin Fc region also preferably comprises at least a 
portion of the DNA encoding a hinge and a CH3 domain of Fey or the 
5 homologous domains in any of IgA, IgD, IgE or IgM. 

The currently preferred secretion cassette is a polynucleotide 
encoding, in its 5* to 3* direction, the signal sequence of an 
immunoglobulin light chain gene and the Fcyl region of the human 

10 immunoglobulin yl gene- The Fcyl region of the immunoglobulin yl gene 
includes at least a portion of the hinge domain and CH3 domain, or at 
least a portion of the hinge domain, CH2 domain and CH3 domain. The 
DNA encoding the secretion cassette can be in its genomic configuration 
or its cDNA configuration. However, the studies described below use a ~ 

15 secretion cassette in the genomic configuration. The use of human Fcyl 
aa the Pc region sequence has several advantages. For exan^le, if the 
fusion protein is to be used as a biopharmaceutical, the Fcyl domain 
may confer the effector function activities to the fusion protein. The 
effector function activities include the biological activities such as 

20 con^jleraent fixation, antibody-directed cellular cytotoxicity, ability 
for placental transfer, and a longer serum half -life. Ttie Fc domain 
also provides for detection by anti-Pc BLISA and purification through 
binding to firaphvlococcus flurcus protein A ("Protein A») . In certain 
applications it may be desirable to delete specific effector functions . 

25 from the Pc region, such as Pc receptor binding or con^lement fixation. 

In another embodiment the Pc region can be a murine immunoglc^lin 
gene. The use of murine Pc as the Pc region can have advantages. For 
example, if the fusion protein is to be used for the preparation of 
30 proteins in mice, then the murine Fc region will not elicit an immune 
response in the host animal . The Fc domain may confer the effector 
function activities to the fusion protein, and allow for detection of 
the fusion protein by anti-Fc ELISA and purification through binding to 
Protein A. In certain applicatioxis it may be desirable to delete 
' 35 specific effector functions from the Pc region. 
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In another embodiment the DNA sequence encodes a proteolytic 
cleavage site interposed between the secretion cassette and. the target 
protein. A cleavage site provides for the proteolytic cleavage of the 
encoded fusion protein thus separating the Fc domain from the target 
5 protein. As used herein, "proteolytic cleavage site" is understood to 
mean the amino acid sequences which are cleaved by a proteolytic enzyme 
or other proteolytic cleavage agents. As will be described in more 
detail below, useful proteolytic cleavage sites include amino acids 
sequences which are recognized by proteolytic enzymes such as trypsin, 
10 plasmin or enterokinase K« 

In a preferred embodiment the target protein sequence encodes 
prostate specific membrane antigen. PSMA. PSMA is a type II membrane 
protein, thus the extracellular domain, or soluble form of the protein, 
15 is utilized as the target protein sequence. The encoded soluble form 
of PSMA can be a human sequence such as the sequence provided in 
Israeli et al. (1993) Cancer Res., S3:227-ff. 

In another preferred embodiment the target protein sequence encodes 
20 the protein gpl20- The envelope protein gpX20 qf human 

immunodeficiency virus is a glycoprotein which is expressed in infected 
cells as a polyprotein. gpl60, and then cleaved by a cellular protease 
to gpi20 and gp4i. The nucleotide sequence and amino acid sequence of 
gpl20 is provided in Ratner et al*, 1985, Nature, 313:277*ff. 

25 

In another aspect, the VSh sequence of the invention is integrated 
within a replicable expression vector. As used herein, •vector" is 
understood to mean ai^ nucleic acid comprising a nucleotide sequence of 
interest and competent to be incorixnrated into a host cell and to be 

30 recombined with and integrated into the host cell genome, or to 
replicate autononmisly as an episome. Such vectors include linear 
nucleic acids, plasmids, phagemids, cosmids and the like. A preferred 
expression vector is pdC, in which the transcription of the immunofusin 
DNA is placed under the control of the enhancer and promoter of the 

35 human cytomegaloviriis . The vector pdC was derived from pdEMp, which is 
described in liO et al* 1991, Biochim, Biophys. Acta 1088:712 (which 
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re- 
publication is incorporated herein by reference) as follows. The Sall- 
Xhol fragment containing the original enhancer and promoter sequence 
were replaced by the enhancer and promoter of the human cytomegalovirus 
by standard molecular biology techniques. The enhancer and promoter 
5 sequence of the human cytomegalovirus used was derived from nucleotides 
• 601 to +7 of the sequence provided in Boshart et al,, 1985, Cell 
41:521, which is incorporated herein by reference. The vector also 
contains the mutant dihydrofolate reductase gene as a selection marker 
(Simonsen and Levinson (1983) Proc. Nat. Acad. Sci. USA 80:2495, 
10 incorporated herein by reference) . 

An appropriate host cell can be transformed or transfected with the 
DNA sequence of the invention, and utilized for the expression and 
secretion of a target protein. Currently preferred host cells for use 

15 in the invention include immortal h^daridoma cells, myeloma cells, 293 
cells, Chinese hamster ovary cells, Hela cells, and COS cells. As used 
herein, "gene expression* or "expression of a target protein* is 
understood to refer to the transcription of the DNA sequence, 
translation of the mRNA transcript, and secretion of the fusion protein 

20 product . 

The method of the invention involves providing a DNA sequence 
encoding an immunofusin. transfecting the DNA sequence into a host cell* 
by an available transfection or transfomation technique, culturing the 
25 tiransfected host cell in a suitable media under conditions which 

promote the expression and secretion of the immunofusin, and collecting 
the fusion protein from the extracellular media. When desired* the 
target protein may be cleaved from the secretion cassette either before 
or after it is collected from the extracellular media. 

30 

Other advantages and features of the invention will be apparent 
from the description, drawings, and claims which follow. 
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Brief Description of the DrawinQ 

The Figures lA-D are a schematic illustration of an immunofusin. 
Figure lA, "DNA. " illustrates the DNA encoding an immunofusin protein. 
5 Figure IB, "Fused Protein 1," illustrates the immunofusin protein prior 
to cleavage of the signal sequence. Figure IC, "Fused Protein 2,- 
illustrates the immunofusin protein after cleavage of the signal 
sequence. Figure ID, "Target Protein," illustrates the target protein 
portion of an immunofusin protein after cleavage of the immunofusin 
10 protein at the cleavage site which is interposed between the Fc region 
and the target protein. 
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Petftj,led PesgrjiptApn 

The present invention is a DNA comprising a polynucleotide 
encoding, in the 5' to 3* direction, a signal sequence, an Fc region of 
5 an immunoglobulin, and a target protein. This approach to the 

expression and subsequent secretion of a target protein is superior to 
the existing techniques because of the choice and the configuration of 
the secretion cassette which is placed at the 5* end of the fusion 
construct. Additionally, the regulatory sequences which direct the 
10 e3q)ression of the secretion cassette can be optimized, and the 
optimized secretion cassette can be paired with numerous target 
proteins, thus allowing for the efficient production of numerous fusion 
proteins. 

15 The production of the immunofusin proteins is characterized as 

efficient and high level, because the target protein has been produced 
at the level of several micrograms/milliliter using the DMAs and 
methods according to the invention. Previously, workers in the art 
have rarely quanticated the esqiression levels of hard to exprass 

20 proteins due to the low levels of expression that are obtained in the 
known mammalian expression systems and the difficulties faced in 
quantitating proteins by techniques such as western blotting and RIA. 
. Prior to the teachings of this invention, expression of microgram per 
milliliter of hard to express proteins would often be attenqpted using 

2S bacterial esqpression systcsms. 

This invention is based on the concept that the ease of production 
and collection of a target protein could be inproved if the polypeptide 
of interest were linked to an immunoglobulin Fc domain and the fiision 

30 protein were expressed in a host cell, in particular a complementary 
host cell which naturally expresses the immunoglobulin, such that the. 
fusion protein «fould be readily secreted from the host cell. In 
addition to promoting the secretion of the fusion protein from the host 
cell, the PC region can further be exploited to aid. in the purification 

35 of the fused polypeptide. Hie general approach- of the invention 
involves the construction of recombinant DKA which encodes a fused 
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polypeptide, which upon expression, results in expression of a 
secretion cassette linked to a target protein, i.e., a protein of 
interest having potential or demonstrable utility. 

5 The overall structure of the preferred DNA of the invention, the 

fusion protein it encodes, the form of the protein which is most often 
secreted and the target protein product after enzymatic cleavage are 
illustrated schematically in Figures lA-D. Reference characters in the 
DNA, Figure lA, are carried over into the protein. Figures IB-D, as 

10 corresponding primed characters. The DNA which encodes the immunofusin 
is shown between the start and the stop markers on the illustrated DNA 
sequence. Figure lA. Upstream regulatory elements are shown at the 5* 
end of the DNA and are labeled * regulatory sequences*. The DNA is 
composed of three distinct polynucleotides which are linked together. 

IS In Figure lA, 3' of the regulatory sequences, which may be optimized 

for each secretion cassette, is a first DNA 8 which encodes a secretion 
cassette comprising two of the three polynucleotides: 1) a signal 
sequence 10, and 2) an immmoglobulin Pcf region 12. The 
imniunoglobulin Per region is con^rised of three subregions: 1) a hinge 

20 region 14, 2) a CH2 region 16, and a CH3 region 20. Attached to the 3' 
end of the DNA encoding the secretion cassette is the third 
polynucleotide, a DNA encoding the target protein 24. Optionally, DNA 
encoding a proteolytic cleavage site 22 can be interposed between the 
DNA encoding the CH3 region of the immunoglc^ulin Pcjr region and the 

2S DNA encoding the target protein. 

The encoded fused protein comprises the secretion cassette 8* and 
the target protein 24 shown as Fused Protein l in Pig. IB. Most 
often the signal peptide 10* will be enzymatically cleaved from the 

30 fusion protein by the host cell prior to the secretion of the 

immunofusin^ «nd thus Pused Protein 2, shown in Figure 1C« shows the 
secreted fused protein irtiich comprises the Fcr peptide 12' fused to the 
target polypeptide 24* . Both Fused Protein 1 and Fused Protein 2 show 
the c^tionhl interposition of a proteolytic cleavage site 22' between 

35 the CH3 domain 20* of the Pcy region 12* and the target protein 24 • . 

Cleavage of either Pused Protein with the appropriate proteolytic agent 
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at the cleavage site 22' results in the release of the target protein 
24* from the Fc region 12*, as shown in Figure ID. 

The processes for manipulating, amplifying and recombining DNAs are 
5 generally well known in the art. and therefore are not described in 
detail herein. Methods of identifying and isolating genes encoding 
proteins of interest, or for constructing such genes, are well 
understood and developed. In general the methods involve selecting 
genetic material coding for amino acids which define the polypeptide of 
10 interest according to the genetic code. 

Accordingly, the DMA construction principle disclosed herein can be 
exploited using known recombinant DNA techniques involving the use of 
various restriction enzymes which make sequence specific cuts in DNA to 

15 produce blunt ends or cohesive ends, DMA ligase techniques enabling 
enzymatic addition of sticky ends to blunt ended DNA, construction of 
synthetic DNAs by assembly of short oligonucleotides, cDNA synthesis 
techniques, polymerase chain reaction, and synthetic probes for 
isolating genes having a particular function. Various promoter 

20 sequences and other regulatory DNA sequences used in achieving 
expression, and various types of host cells are also known and 
available. Conventional transfection techniques, and equally 
conventional techniques for cloning and subcloning DNA ere useful in 
the practice of this invention and known to those skilled in the art, 

25 various types of vectors may be used such as plasmids and viruses 

including animal viruses. The vectors toy cjq)loit various marker genes 
which Impart to a successfully transfected cell a detectable phenotyplc 
property that can be used to identify which of a family of cells has 
successfully incorporated the recontolnant DNA of the vector. Given the 

30 foregoing state of the genetic engineering art, skilled persons are 
enabled to practice the Invention disclosed herein In view of this 
disclosure. 

One method for obtaining the DNA encoding the various synthetic 
35 linkers disclosed herein is by assembly of synthetic oligonucleotides 
in a conventional, automated, polynucleotide synthesizer followed by 
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ligation with a ligase. For example, the linkers can be synthesized as 
con^lementary DNA fragments using phosphoramidite chemistry. 

The signal sequence of the invention is a polynucleotide which 
5 encodes an amino acid sequence that initiates transport of a protein 
across the membrane of the endoplasmic reticulum. Signal sequences 
which will be useful in the invention include antibody light chain 
signal sequences, e,g., antibody 14,18 (Gillies et. al., 1989, Jour, of 
Immunol. Meth,, 125:191-202), antibody heavy chain signal sequences, 

10 e.g., the M0PC141 antibody heavy chain signal sequence (Sakano et alw 
1980, Nature 286:5774), and any other signal sequences which are known 
in the art (see for exan5)le, Watson, 1984, Nucleic Acids Research 
12:5145). Each of these references is incorporated herein by 
reference. Signal sequences have been well characterized in the art 

15 and are known typically to contain 16 to 30 amino acid residues, and 
may contain greater or fewer amino acid residues. A typical signal 
peptide consists of three regions: a basic N- terminal region, a central 
hydrophobic region, and a more pdlar C- terminal region. The central 
hydrophobic region contains 4 to 12 hydrophobic residues that anchor 

20 the signal peptide across the membrane lipid bilayer during transport 
of the nascent polypeptide. Following initiation, the signal peptide 
is usually cleaved within the lumen of the endoplasmic reticulum by 
cellular enzymes known as signal peptidases. Potential cleavage sites 
of the signal peptide generally follow the "(-a, -1) rule*. Thus a 

25 typical signal peptide has small, neutral amino acid residues in 
positions -1 and -3 and lacks proline residues in this region. The 
sigxial peptidase will cleave such a signal peptide between the -I and 
♦1 amino acids. Thus, the portiort of the DNA encoding the signal 
sequence may be cleaved from the amino-terminus of the immunofusin 

30 protein during secretion. This results in the secretion of a 

immunofusin protein conisisting of the Pc region and the target protein. 
A detailed discussion of signal peptide sequences is provided by von 
Heijne (19861 Rucleic Acids Jles-. 14:4683 (incorporated herein by 
reference) . As would be apparent to one of skill in the art, the 

35 suitability of a particular signal sequence for use in the secretion 
cassette may require some routine experimentation. Such 
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experimentation will include determining the ability of the signal 
sequence to direct the secretion of an inununofusin and also a 
determination of the optimal configuration, genomic or cDNA, of the 
sequence to be used in order to achieve efficient secretion of 
5 iramunofusins. Additionally, one skilled in the art is capable of 
creating a synthetic signal peptide following the rules presented by 
von Heijne, referenced above, and testing for the efficacy of such a 
synthetic signal sequence by routine experimentation. A signal 
sequence is also referred to as a -signal peptide-, -leader sequence* 
10 or "leader peptides" and each of these terms having meanings synonymous 
to signal sequence may be used herein . 

The Fc region of an immunoglobulin is the amino acid sequence for 
the carboxyl -terminal portion of an immunoglobulin heavy chain constant 

15 region. The Fc regions are particularly in5)ortant in determining the 
biological functions of the immunoglobulin and these biological ^ 
functions are termed effector functions. As known, the heavy chains of 
the immunoglobulin subclasses comprise four or five domains: IgM and 
IgE have ^ive heavy chain domains « and IgA, IgD and IgG have tour heavy 

20 chain domains. The Fc region of IgA, IgD and IgG is a dimer of the 
hinge-CH2-CH3 domains, and in IgM and IgE it is a dimer of the hinge- 
CH2-CH3-CH4 domains. Further the CH3 domain of IgM and IgE is 
structurally equivalent to the CH2 domain of IgG, and the CH4 domain ot 
IgM and IgE is the hmolog of the CHB domain of IgG <8ee, if.E«PauI, 

25 ed., 1993, Fundamental Immunology, Raven Press. New York, New York, 
which publication is incorporated herein by reference) . Any of the 
kxiown Fc regions would be useful as the Fc region of the eecretlon 
cassette. Howver, it is important that the binding sites for certain 
proteins be deleted from the Fc region during the construction of the 

30 secretion cassette. For example, since coexpression with the light 
chain is unnecessary, the binding site for the heavy chain binding 
protein, Bip (Hendershot et al. (1987) y^WTO?.. TPd^Y &:lll'll^) . 
should be deleted from the CH2 domain of the Fc region of IgE, such 
that this site does not interfere with the efficient secrcition of the 

.35 immunofusiti. Likewise, the cysteine residues present in the Fc regions 
which are responsible for binding to the light chain of the 
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iininunoglobulin should be deleted or substituted with another amino 
acid, such that these cysteine residues do not interfere with the 
proper folding of the Fc region when it is produced as an immunofusin. 
In the same manner, transmembrane domain sequences, such as those 
5 present in IgM. should be deleted such that these sequences do not 
result in misdirecting the immunofusin to the membrane as a 
transmembrane protein. 

Upon expression and production of the Fc region as a portion of the 

10 secretion cassette, it may retain some of the biological properties, 
termed "effector functions", which are native to the particular 
immunoglobulin class from which the Fc region is obtained. Useful 
effector functions include, for example, coii?)lement fixation, Fc 
receptor binding, binding to cell membranes, and placental transfer, . 

15 m some cases, it may be advantageous to modify or remove one or more 
of these effector functions, such as Fc receptor binding or conplement 
fixation, using site directed mutagenesis or other well known molecular 
biology techniq[ue8, Fdr example* Duncan et al. (Nature, 1988, 332:738) 
have mapped the amino acids responsible for the several of the 

20 immunoglobulin gamma effector functions activities, see also, Duncan 
et al,, 1988, 332:563; Yasmeen et al,, Immunol., 1976, 116:518; Tao et 
al., J. Immunol,, 1989, 143:2595, Each of these publications is 
incorporated herein by reference • The amino acids or peptide segments 
responsible for these functions can be deleted thus removing that 

25 portion of the Fc region, or substituted with sequences which would not 
confer the function using well known molecular biology techniques. 

The currently preferred class of immunoglobulin from which the Pc 
region is derived is immunoglobulin gamma-1, because it has been well 

30 ^racterized and is efficiently secreted from most cell types . The Pc 
region of the other subclasses of iminunogiobulin ganma (gamma-2, gamma* 
3 and 9afflma*4) would function equally well in the secretion cassette. 
The Pc region of immunoglcAmlin gamma- 1 is preferably used in the 
secretion cassette includes at least part of the hinge region, CK2 

35 region, and CH3 region. In addition, the Fc region of immunoglobulin 
gamma-l can be a CK2*deleted-Pc. which ixicludes a part of a hinge 
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region and a CH3 region wherein the CH2 region has been deleted. A 
CH2 -deleted- Fc has been described by Gillies et al,, 1990, Hum. 
Antibod. Hybridomas. 1:47, which publication is incorporated herein by 
reference- 

5 

As is apparent from the above discussion of Fc regions, the Fc 
regions from the other classes of immunoglobulins, IgA, IgD, IgE, and 
IgM, would also be useful as the Fc region of the secretion cassette. 
Further, deletion constructs of these Fc regions, in which one or more 
10 of the constant domains are deleted would also be useful. One of 

ordinary skill in the art could prepare such deletion constructs using 
well known molecular biology techniques. 

The identity of the target protein produced in accondance with the* 

15 invention is essentially unlimited. Indeed* an iinportant feature of 
the invention is that it provides a generalized DNA construct, and 
procedure which can be adapted to facilitate recombinant production of 
any desired target protein. For instance « the application of the 
invention to the e3q)ression of the regulatory proteins, such as 

20 transcription factors which are normally localized to the nucleus, 
allows for the efficient secretion of such normally non-secreted 
proteins. In addition, regulatory proteins are in general diffiduTt to 
express and the purification procedures are generally cumbersome 
for example, Me'isteremst et al. (1951) Cell 66:981) . Therefore, it is 

25 especially desirable that such proteins be exported into the culture 
medium. Additionally^ the invention can be used to enhance the 
production and secretion of proteins lAiich are normally secreted at low 
levels. If a desired target protein includes sequences encoding a 
secretion signal or a transmembrane signal, these sequences can be 

30 removed from the target protein such that the secretion cassette 
directs the secretion of the fusion protein. 

The optional proteolytic cleavage site may be any amino acid 
sequence which is recognized by specific cleavage agents « The 
35 specificity of cleavage agents is determined by the identity of the 
sequence of amino acids at . or near the peptide bond which is to be 
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hydrolyzed. A given cleavage agent may recognize the bond between two 
specific amino acids or may recognize a bond following one or a 
specific sequence of amino acids. The specificity of many cleavage 
agents is known. Table l set forth below lists various known cleavage 
5 agents and their primary (and in some cases secondary) sites of action. 



Cleavaoe Aoent 

10 Trypsin 

Chymotrypsin 
Elastase 



15 



Pepsin 
Papain 
Subtilisin 

Tlxermolysin 



20 S ■■, wrevs protease 
Endoproteinase 
Arg C (Submaxillaris 
protease) 
Clostripain 

25 llizombin 
Collagenase 



ZABLE-J. 
Maior SitP of Action 

Arg, Lys 
Trp. Phe, Tyr 
Neutral Aliphatic 
Residues 
Phe, Leu, Trp 
Arg, Lys, Gly 
Aromatic and . 
Aliphatic residues 
Amino- linked bonds 
of Aliphatic Residues 
Glu 
Arg 



Arg 
Arg 

X-GIy-Pro 



Xiysbbacter 
. enzymogenes 
30 (endoproteinase Lys-C) 
Mysobacter Al-1 Lys 
Protease 

Armillaria mellca I*ys . 
Flavobacterium Pro 
35 meringbsepticum « 

Factor Xa lle-Glu-Gly-Arg 



Action 

Leu, Met, His 



Ma/ Gly, Glu 
Hide specificity 
Various 



Ala, Phe 



Asp 



X-Ala-Pro 
X-Gly-lto 
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CNBr Met 

BNPS-sJcacole . Trp 

N-bromosuccinimide Trp 

0-iodosobenzoic Trp 
5 acid 

HBr/DMSO Trp 

NTCB Cys 
Sodium metal in 

liquid ammonia Pro 

10 Hydroxy lamine Asn-Gly 

Dilute acid Asp-Pro 

Other cleavage agents are known. Those preferred for use in the 
invention are enzymes with a primary site of action which cleave at the 
IS C- terminal side of the cleavage site residue. 

The cleavage site in the fused protein generally can comprise any 
one or sequence of amino acids which can be cleaved by a cleavage agent 
specific for the site in an appropriate environment* Specificity of 

20 cleavage can be increased, and likelihood of undesired cleavage within 
the target protein or elsewhere in the fused polypeptide can be 
decreased^ by selecting the cleavage agent having a site of action 
which is absent from the target polypeptide. The fused polypeptide is 
preferably cleaved under conditions in which it has assumed its native 

25 conformation. This has the effect of masking the presence of potential 
cleavage sites in the target polypeptide. . 

The invention is illustrated further by the following non- limiting 
examples . 



30 



Example l. Construction of a Secretion Cassette 



The construction of an exemplary secretion cassette is described 
below. As would be appreciated by those of ordinary skill in the art, 
35 the signal sequence, and the Fc region of an immunoglobulin could be 
other sequences than those described. 
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The signal sequence of an immunoglobulin light chain of the 14.18 
antibody was selected for use as the signal sequence of the secretion 
cassette. The sequence of the 14.18 antibody light chain is provided 
5 in Gillies et al., 1989, Jour. Immunol. Meth., 125:191-202 and is 

incorporated herein by reference. The signal sequence was modified for 
ease of cloning as an Xbal-Af III fragment of the DMA. As would be 
apparent to those of skill in the art, the DNA encoding a human signal 
sequence could also be used. Specifically, an Xbal site was introduced 

10 of the translation initiation codon and the consensus sequence for 
optimal ribosome binding (Kozak, 1984, Mature 308:241. incorporated 
herein by reference) • An Aflll site was introduced into the 3* end of 
the signal sequence by ttoitagenizing the DNA coding for the penultimate 
amino acid residue of the signal peptide from a serine to a leucine, 

15 thus the sequence ATC was mutagenized to TTA using site directed 
mutagenesis. 

The Fc region of an immunoglobulin was selected to be the human 
Pcyl genomic DNA, including the genomic configuration of the hinge, CH2 

20 and CH3 domains. The genomic sequence of human Fcyl is provided in 
Huck et al., C1986) Nucleic Acids Res. 14:X779 and is incorporated 
herein by reference. As would be apparent to one of ordinary skill in 
the art . a CH2-deleted-Pc may also be used as the Fc region of the 
secretion cassette (see. Gillies et al.« 1990, Rum. Antibod. 

25 Hybridbmas, ls47) , in which case the CH2 domain would be deleted from 
the Pc region using established molecular biology techniques during the 
construction of the secretion cassette. The genomic DNA of Pcyl was 
modified for ease of cloning as an Aflll-Xmal fragment. The 5' end of 
the human Pc genomic DNA was mutagenised to an Aflll site by performing 

30 a Polymerase Chain Reaction (PCR) using a 5* sense primer with the 
following sequence (Sequence ID No. l) : ' 

(UtfSAAT TCTTAAGO Ga^SCCCAAATC^ 

This primer introduced an Aflll site (underlined) and a cysteine to 
serine mutation (TGT to TCT. bold) . The cysteine being mutated is the 
35 Tone that is normally involved in disulphide bonding with the light 

chain and. thus does not affect the effector functions of the Pc region. 
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The deletion of this cysteine may serve to enhance the production of 
the Fcyl region as the efficient production of this modified Fcyl 
region will not require the coexpression of the immunoglobulin light 
chain. This cysteine was also removed such that it does not interfere 
5 with the proper folding of the Fcyl region or the fused target protein. 
The 3' end of the Fcyl genomic DKA encodes for two Xmal restriction 
sites. They are located at 10 and 280 bp upstream of the translation 
stop codon in the CH3 domain. The distal Xmal site was destroyed by 
introducing a silent mutation, using site directed mutagenesis, (TCC to 
10 TCA, where the CC were the first two bases of the Xmal site) so that 
the Xmal site 10 bp upstream of the stop codon became unique. 

The Xbal-Aflll restriction fragment encoding the light chain signal 
peptide was then ligated to the Af III -Xmal restriction fragment 
15 encoding the Fc region. The resultant Xbal-Xmal restriction fragment 
therefore encodes the secretion cassette, and the gene encoding the 
target protein of interest can be ligated to the 3' end of the 
secretion cassette via the Xmal site. 

20 In general, the DNA encoding the target protein can be ligated to 

the unique Xmal site through the use of a linker -adaptor, such a 
linker-adaptor may also include restriction endonuclease sites in 
addition to an Xmal site . The use of a linker -adaptor has the 
additional feature in chat It can encode a proteolytic cleavage site 

25 for subsequent use in cleaving the target protein from the secretion 
cassette after production and secretion of the fusion protein. For 
BXBXuple, the linker-adaptor can encode a lysine residue at the junction 
of the fusion protein* which provides the option of cleaving the target 
protein from the Fc domain by proteolytic ensymes such as trypsin or 

30 plasmin. Similarly, the linker adaptor can include a DMA encoding the 
cleavage site of enterokinase K (Asp-Asp-Asp-Asp-Lys) in order to 
provide for the specific cleavage of the secreted fusion protein by 
enterokinase K« 

35 Example 2. Construction of an Immunofusin 
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The construction of an exemplary immunofusin, including a secretion 
cassette and a target protein is described below. As would be apparent 
to those of ordinary skill in the art, other target proteins can be 
fused to a secretion cassette using the same or other molecular cloning 
S techniques . 

The target protein for the exemplary immunofusin was chosen to be 
CD26, which is a type II membrane protein having its active site within 
the carboxyl- terminal region of the protein which is the extracellular 

10 domain. During the construction of a CD2€ immunofusin, the cytoplasmic 
and transmembrane domains of CD26 were deleted so that they would not 
interfere with the secretion of the immunofusin by the secretion 
cassette. The 5* end of the cDNA encoding the extracellular domain was 
modified for ease of cloning to include a Xmal site, which was 

IS introduced via a linker-adaptor. The 3' end of the C026 cDNA was also 
modified for ease of cloning to include a Xhol site, which could be 
introduced do%nistream of the translation stop codon either hy PGR or by 
linker-adfiptor ligation. 

20 Various linker -adaptors can be used depending upon the desire for 

introduction of a proteolytic cleavage site between the DNA encoding 
for the Fc region and the CD26 cDKA. For exan5)le, one linker-adaptor 
which can be used for CD26 is.: 

5* COO GGT (AAA) GGC ACA OAT GAT OCT ACA G 

25 3* CA (TIT) TIG TGT CIA CTA 06A TGT C 

as provided in Sequence ID Hos. 2 and 3. Ibe first three codons in. the 
top strand encode the last three amino acid residues of the CH3 domain, 
and starting vith the codon GGC is the gene sequence of the 
extracellular domain of CD26. This linker-adaptor had the cohesive end 

30 of an Xmal site at its 5* end and the blunt end of a PvuII site at its 
3* end, the blunt ended PvuII site being a convenient site for 
reconstruction with the rest of the CD26 cDKA. The lysine codon (AAA, 
in parenthesis) in the linker-adaptor is but one of many optional amino 
acid sequences which are tiseful to provide for a proteolytic cleavage 

35 site by cleavage agents. For example, this lysine residue can be 
cleaved by enzymes such as trypsin or plasmin. 
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Alternatively, for more specific proteolytic cleavage by 
enterokinase K, the gene sequence encoding the enterokinase K cleavage 
site can be introduced via the following 1 inker -adaptor : 
5 5* CCG GGT TCA GGG GAT GAC GAT GAC GAT A 

3' CA AGT CCC CTA CTG CTA CTG CTA TTC GA 

as provided in Seg. ID Nos. 4 and 5. The nucleotides in bold encode 
the amino acid residues (Asp)4-Lys, which is the recognition site of 
enterokinase K. The linker-adaptor ends with a Hindlll site, to which 
10 the CD26 gene or other target protein gene sequences can be joined. 

Example 3. Host Cells and Transfection 

The preferred host cell lines include the mouse myeloma Cor 
15 hybridoma) NS/0 and Sp2/0 Agl4 cells. The myeloma cells were 

transfected by protoplast fusion and selected in Dulbecco's modified 
Eagle* s medium (Gibco) containing 10% fetal bovine serum and 100 nM 
methotrexate, as described by Gillies et al., 1989, BioTechnologi'. 
7:799, which publication is incorporated herein by reference. 
20 Transfectants secreting the immunofusins were identified by anti-Fc 
ELISA, as described by Gillies et al. (1969) J. Ximmmol. Methods 
125:191, which publication is incorporated herein by reference. The 
highest producers were adapted to media containing 1 MM MIX and 
subcloned by limiting dilutions. For the production of immunofusins, 
25 the cells were grown in Hybridoma Serum-Free Media (HSFM, Gibco) 
containing It fetal bovine seriun and 1 }iM HTX. 

The other preferred recipient cell line is the human kidney 293 
cells, which is useful for both transient and stable expression. Other 

30 cells, such as the HeLa and the Chinese hamster ovary (CHO) cells, also 
worked in our system. The preferred method of transfection for these 
adherent cells is by coprecipitation of plasmid OKA with calcium 
phosphate, and other methods include lipof action and eiectroporation. 
For a description of these methods and other useful transfection 

35 methods see, Santoook et al. (1989) Molecular Clanino-^A Labdratorv 
Manual , Cold Spring Harbor, OT, incorporated herein by reference. 
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Example 4. rhararrRriaarion and Pnrif irflrion 
Immunofuains 



5 For routine characterization by gel electrophoresis, immunofusins 

in the conditioned media were first captured on Protein A Sepharose 
<Repligen. Cambridge, MA) and then eluted by boiling in protein sample 
buffer with or without 2-mercaptoethanol. After electrophoresis on an 
SDS-gel. the protein bands were visualized by Coomassie staining. For 

10 example, the IL2 immunofusin. see example 5. gave a band having the 
molecular weight of 45 JcD under reducing conditions and a band having 
the molecular weight of 90 kD under non-reducing conditions, showing 
that the IL2 immunofusin was produced as a diroer. presumably through 
disulphide bonding in the hinge domain of the Fc region. 

For purification, the cell culture media was collected and then the 
immunofusins were bound on Protein A Sepharose. The immunofusins were 
subsequently eluted trtm the Protein A in a sodium citrate buffer (100 
BM. pH 4) . The eluate was then immediately neutraliaed with O.l volume 
of 1 M Tris-hydrochloride. pH 8. In the case of CD26 immunofusin. it 
was shown that such an elution procedure resulted in greater than 80t 
recovery of the CD26 immunofusin with no loss of enzyme activity 



Bxan^le 5. p«prgfision nf II.2 Tmmunofusin 



The cDHA of mature IL2 protein was modified for ease of cloning to 
have a S' Xmai restriction endonucleaae site and a 3' Xhol .restriction 
endonucleasc site using well known molecular techni<|ues. such as those 
Which were as described In exasiple 2. The sequence of the mature II.2 

30 cOHA is provided in Taniguchi et al.. 19«3, Bature. 302t30S and is 
incorporated herein fcv reference. The cdHa of the mature 112 protein 
was constructed using recontolnant techniques as a synthetic gene in 
order to optimise eodon tkssge and to Introduce desirable restriction 
endoftucle'ase cleavage sites. Ttie synthetic gene was created using 

35 conventional DHA manipulation techniques. Once the synthetic IL2 cDMA 
was constructed, the 5' JCmal site of the IL2 cTOIA was ligated to the 3- 
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Xmal site of the secretion cassette, described in Example 1. The IL2 
immunofusin, was then cloned into the expression vector pdC. The IL2 
immunofusin expression vector was transfected into NS/0 and Sp2/0 as 
host cells by protoplast fusion, as is described by Gillies et al . , 
5 1989, Biotechnology, 7:799. 

Two to three weeks after transfection, MTX-resistant NS/0 and Sp2/0 
clones appeared. The initial clones were screened by anti-Fc ELISA. 
The IL2 immunofusin protein was collected from the media. An 

10 appropriate assay for the biological activity of IIj2 was the standard 
T-cell proliferation assays according to Gillies et al. (Proc. Natl. 
Acad. Sci. (1992) 89:1426), which is incorporated herein by reference. 
The spent culture of the best clone contained about lOO »ig/ml of IL2 
immunofusin. The host cell clones which efficiently produced and 

15 secreted the IL2 immunofusin protein were subcloned in media containing 
100 nM MTX, and the best subclone produced about 200 iig/ml of protein 
in spent culture- When MIX was left out of the media in the 
subcloning, the best subclone thus isolated produced about 180 Hg/ml in 
spent culture. Thus, the construction of an IL2 immunofusin 

20 unexpectedly provided for the production of IIi2 at a level which is 
about 80 times chat «;hich can be achieved by the ea^ression of IL2 
alone using the pdE^^ vector (unpublished data) , and many times o£ that 
of the IL2 that was expressed in mammalian cells (Conradt et al., 1989,^ 
J. Biol« Chem.« 264:17368) and in yeast (£nist et al.« 1989, 

25 Biotechnology, 7:716). As mentioned in escomple 4, IL-2 immunofusin was 
produced as a homo-dimer of molecular weight of 90 kO, presumably 
j through disulphide bonding in the hinge domain of the 45 kD monomers. 

Exaoisle 6. Emression of CD26 immunofusin 

30 

The construction of CD26 as an immunofusin was undertaken to 
demonstrate that the invention is applicable to the expression of 
membrane anchored proteins such as type II. membrane proteins. A type 
II metcbrane protein displays the carboxyl* terminal domain on the 
35 extracellular surface, and most often includes its active region within 
this cart)Oxyl -terminal domain. The joining of a fusion polypeptide to 
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the carboxyl -terminal region of such a protein may interfere with the 
proper folding of the active site, and thus reduce or prevent the 
production of active protein. 

5 CD26 is a type II membrane protein comprising 766 amino acid 

residues. The biological fimction of CD26 is as a T cell activation 
antigen and the putative coreceptor for entry of HIV in CD4+ cells 
(Callebaut et al. <1993) Science 262:2045). The CD26 protein is 
anchored to the lipid bilayer of the plasma membrane through a 

10 hydrophobic domain between residues 7 and 28 at the M-terminus. Amino 
acids 1 to 6 form a short cytoplasmic tail- The rest of the protein* 
between res'idues 29 and 766, is extracellular and includes several 
potential N-glycosylation sites and the active site of the enzyme 
(Tanaka et al. (1992) J. Immunol. 149:481). The 728 carboxyl- terminal 

15 residues in CD26 protrude from the membrane surface and the C-terminus 
is free. A soluble CD26 expressed as an immunoadhesin* will have a 
conformation different from that of the native CD26, because the 
carboocyl -terminus in an immunoadhesin CD26 protein is not free but 
connected to antibody sequence. On the other hand« if we engineer an 

20 immunofusin in which the antibody sequence is amino -terminal to the 
target protein, such as CD26, the native conformation of CD26 will be 
preserved, i.e. the C-terminus is free, and the antibody sequence, 
herein an Pc region, takes the place of the membrane to which CD26 is 
normally anchored. The enzymatic and biological activities of such a 

25 soluble CD26 isminofusin will not be compromised. In addition* CD26 is 
a protease and its eaq^ression may be deleterious to the host cell. 
Thus by efficiently eaqiorting the CD26 protease outside of the host 
cell in the form of an immunofusin* a higher level of expression can be 
achieved. 

30 

A 2.3 kb cOKA fragment encoding the extracellular domain of C026 
was used to construct the CD26 imnaunofusin expression vector. The IMiA 
sequence of CD26 is provided in Tanaka et al., 1992, J. Immunol., 
149:481 and is incorporated herein by reference. CD26 was fused 3* of 
35 the secretion cassette as described above in example 2, and then the 
secretion cassette and CD26 target protein were cloned into the 
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expression vector pdC using the Xbal restriction endonuclease site 5' 
of the light chain signal sequence and the Xhol restriction 
endonuclease site 3' of the CD26 protein as described in exan^jle 2 
above. The resultant CD26 immunofusin expression vector was 
5 transfected into a host cell as described in Exanple 3 above. MTX- 

resistant clones from transfected NS/0 and Sp2/0 cells were screened by. 
anti-Pc ELISA and DPPIV activity assay. CD26 is also known as DPPIV, 
which is an exopeptidase that cleaves after amino- terminal X-P {X can 
be any amino acid residue « and P is proline) . DPPIV enzyme activity of 

10 the CD26 immunofusin was assayed according to Tanaka et al,, Proc. 
Natl. Acad. Sci., 1993, 90:4586, incorporated herein by reference, 
using glycylproline p^nitroanilide tosylate (Gly-Pro-pNA) as a 
substrate. The best NS/0 clone produced about 3.5 \ig/ml of CD26 
immunofusin. The DPPIV moiety of the protein product was determined to 

15 be fully active, having and k^at values similar to those of the 

native CD26. Furthermore the enzymatic activity of CD26 immunofusin 
was inhibited by known peptide inhibitors in a dose -dependent manner. 
The peptide inhibitors tested included Che cripeptides IPI and VPL and 
APL, each of which inhibited the CD26 enzyme activity greater Chan 30% 

20 at 0.15 mM, greater than 70% at 1 wM and greater than 90% at 4 mM. As 
a control known non- inhibitor peptides were also tested for their 
effect upon CD26 enzyme activity and the known non- inhibitors, GGG and 
GPHyP (wherein HyP is hydro^qproline) « %#ere found to have no effect on 
Che CD26 accivity whim incubated wich the CD26 immunofusin at 

25 concencraclons ranging becween O.OI mM and 11 mM. 

Example 7. Bimreg8l.gn pg Tftt tmunQtygin 

The invencion was also applied co Che eaq>ression of regulacory 
30 proceins which are normally localized Co che nucleus. Because 

regulacory proteins are in general difficult to esqpress and purify« ic 
is especially desirable to devise a method by «^ich such proteins can 
be efficiently secreted from a host cell. Imiminofusin constructs of 
Tat and Rev (described in example 8) ; which are two proteins encoded by 
35 Che human immunodeficiency virus (HIV) that regulate expression of 

viral proceins in Che cell nucleus, were made in order to determine che 
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efficiency with which chese proteins can be expressed and collected. 
We obtained high level expression and secretion of the Tat and Rev 
immunofusins, and readily purified the immunofusins in a single step. 
A 260 base-pair cDNA fragment encoding Tat was cloned into the Xraal 
5 and Xhol sites of the pdC expression vector by modification of the 5* 
and 3* ends of the Tat protein using recombinant DNA techniques as 
described above. The sequence of the cOMA encoding the Tat protein is 
provided in Ratner et al., 1985. Nature, 313:277, and is incorporated 
herein by reference. Specif ically« the sequence at the 5* end was 

10 modified to* Seq. ID No. S, C CCG GG T CGC ATO GAG . . . . , where the 
underlined sequence is the Xmal site and the AT6 in bold is the 
translation start codon of the Tat gene. At the 3* end, an Xhol site 
was introduced inmiediately domstream of the translation stop codon by - 
standard PGR techniques. The Tat iromunofusin expression vector was 

IS then transfected into a host cell, as described above, and the host 
cells were analyzed for production of Tat itnmunofusin protein. High 
level expression was obtained in transiently transfected 293 cells and 
stably transfected NS/0 cells. Stable NS/0 clones produced about 3 
fig/ml of a 48 kD protein, analyzed on a SDS-gel under reducing 
20 conditions. This protein was confirmed to be Tat immunofusin by an 
anti-Tat antibody (Cat. #7001, American BioTechnologies, CaiAbridge, 
MA). 

The Tat iiraminofusin was shown to be active by the following 
25 transient eaqpression experiment in 293 cells, the results of which are 
presented below in Table 2« The expression vector for Tat immunofusin 
was cot ransf acted with a separate vector containing LTR-TAR-Kappa« 
where LTR-TAR is the long terminal repeat DKA sequence of HIV that is 
transactivated by the Tat protein, and Kappa is the gene sequence 
30 encoding the Kappa light chain of immunoglobulin. To measure 
expression levels of Fc-Tat (Tat immunofusin) and Kappa, the 
supematants were assayed by anti-Pc and anti-Kappa ELISA respectively. 
In Table 2, pdC*Fc*Tat represents Che pdC ejq)res8ion vector for Tat 
immunofusin; LTR-TAR-Kappa represents the eaqnression vector for Kappa 
3S light chain, in which the LTR-TAR regulatory region can be 

transactivated by Tat; and pCBP-Xat is an expression vector for Tat« 
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whose transcription is under the control of the human cytomegalovirus 
enhancer and promoter, pCEP-Tat was used as a positive concrol to 
monitor the transact i vat ion of the LTR-TAR-Kappa by Tat protein. As a 
negative control LTR-TAR-Kappa was transfected alone to demonstrate 
5 that it is not transactivated in the absence of Tac protein or Tat 
immunofusin. As shown in Table ^. high level expression of the Tat 
immunofusin was observed in transfection 1; high level expression of 
both Tat and Kappa light chain were observed in the cotrans feet ion 
experiment, transfection 2. Transact ivat ion of Kappa by Tat was seen 

10 in the positive control, transfection 3, as expected. Little or no 
expression of Kappa was seen in the negative control, transfection 4* 
also as expected. Therefore, the Kappa light chain is eaqjressed only 
through transactivation of the LTR-TAR region by a functional Tat 
protein, and the Tat immunofusin provides a functional Tat protein 

15 which is readily secreted from the host cell. This result also 
demonstrates that the secretion cassette is able to direct the 
secretion of a protein which is normally transported to the nucleus of 
the host cell. 

Table 2 

20 ELISA ( no/ml) 
DKTA used in transfection EC ^QBil 

1. ) pdC-Fc-Tat >3000 0 

2. ) pdC-Pc-Tat, LTR-TAR-Kappa 1600 l60 

3. ) PCEP-Tat. LTR-TAR-Kappa 0 277 
25 4 J LTR-TAR-Kappa 0 3 

pstattptA B. F«pregfilon of Rgv Immiinofusin 

A 350 base-pair cDNA fragment encoding Rev was modified to include 
30 as* Xmal site and a 3« Xhol site and then ligated 3* of the described 

secretion cassette in the pdC expression vector. The sequence of the 

cDNA encoding the Rev protein is provided in Ratner et al., 1985. 

Nature, 313:277, and is incorporated herein by reference. 

Specifically, the 5« end of the cDHA tms modified to C CC6 QgT CGC XTC 
35 GCA .... (Seq. ID No, 7) , where the underlined sequence is the Xmal 

site and the ATG in bold is the translation start codon of the Rev 
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gene. At the 3' end, an Xhol site was introduced immediately 
downstream of the translation stop codon by standard PCR techniques. 
High level expression was obtained in transiently transfected 293 cells 
and stably transfected NS/0 cells. Stable NS/0 clones produced about 3 
5 |ig/lO^ cells/day of the Rev immunofusin, which has a molecular weight 
of about 50 IcD when analyzed on a SDS-gel under reducing conditions. 

Bxaiqple 9. site-soecif ic Proteolytic «eftvi^qg Pf Pn 

0 

An exenplary cleavage of an immunofusin is described below, as 
.would be apparent to one of ordinary skill in the art, each of the 
above described immunofusins could be cleaved from their respective 
secretion cassettes using the same method or an analogous method. 

5 

A CD2€ immunofusin having a lysine residue ("FciLys) -CD26 
immunofusin"). introduced by linker adaptor during construction of the 
immunofusin between the Fc region and the CD26 target protein sequence 
was cleaved using trypsin. TO cleave the Fc<Lys)-CD26 immunofusin* the 

:0 immunofusin was bound on Protein A Sepharose and cleaved at the desired 
lysine position by ttypsin to release CD26 as follows: Fc(Lys)-CD26 
immunofusin bound on Protein A Sepharose was incubated with a 1% 
trypsin solution at 37«C for 2 hr* Trypsin inhibitor (Sigma) was then 
added to stop any further digesticm. The supernatant was then removed 

15 and analyzed on an SDS-gel under reducing conditions. After Ooomassie 
staining « a band having a molecular weight of 110 kD, which corresponds 
to the size of CD26 without the secretion cassette « was bbt:ained. Ab a 
controls CD26 immmofiisin^ without the lysine residue at the junction 
of the fusion between the Pc domain and the CD26 target protein <«Fc- 

10 0)26 immunofusin*) ♦ wais bound on Protein A Sepharose and similarly 
treated. The CD26 was found to not be released from the secretion 
cassette of the FC-CD26 immunofusin^ as was expected* and this also 
confirmed the specific cleavage of the immunofusin at the amino acid 
lysine which was inserted between the CH3 domain of the Fc region and 

\5 the target CD26 protein. As a further control, an identical aliquot of 
FC-CD26 immunofusin i^ch was bound to Protein A Sepharose was boiled 
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in the protein sample buffer and SDS-gel analysis of the supernatant 
showed a 140 kD band corresponding to the full length CD26 immunofusin 
protein monoiner. 

5 The results from the gel electrophoresis experiment were confirmed 

by DPPIV activity assays of the tryptic digests. Quantitative recovery 
of the DPPIV enzymatic activity was obtained in the supernatant when 
the Pc(Lys)-CD26 iirenunofusin bound to Protein A Sepharose was treated 
with trypsin. In the parallel experiment with Pc*CD26 immunofusin, 
10 there was no DPPIV activity in the si^)ematant^ because the CD26 
protein was not released from the Protein A Sepharose. 

Example 10. Eypression Qf OSF'2 i-T»nntfnp^vgiP 

15 OSF-2 is a 80-kD secretory protein that is involved in the 

ossification process. The sequence the DNA encoding 0SP*2 is provided 
in Takeshita et al,, 1993, Biochera. J. 294:271, and is incorporated 
herein by reference. The cDNA encoding the OSF-2 protein with its 
signal peptide was cloned into the expression vector pdC. NS/0 cells 

20 were used for stable transfection and 293 cells were used for transient 
expression; but in neither case was the OSF-2 protein detected. 

The OSF-2 cmA was then adapted to be expressed -as an immunofusin;. 
At the 3* end, the Xbal site at the translation stop codon was 
25 converted to an Xhol site by linker ligation. At the S< end the 
following linker-adaptor was used: 

5* CCGLfiST AAA AAC AAT CAT TAT OAC AA 

3« £A TTT TT6 TXA GXA AXA CXa nC TAG 

as provided in Seq. ID Nos. 8 and 9. Die nucleotides in bold encode 
30 the K-terminus of the mature OSF-2 protein, ending with Bglll cohesive 
ends. These Bglll cohesive ends were ligated to the Bglll-Xhol 
fragment of the OSF-2 CDHA. The Xmal cohesive ends at the S' end of 
the linker-adaptor (underlined) were ligated to the unique Xmal site in 
the immunofusin expression vector. 

35 
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High level expression was obtained in transiently transfected 293 
cells and stably transfected NS/0 cells. Stable NS/0 clones produced 
about S CO 7 ^g/Inl of a 110 kD protein, when analyzed on a SDS-gel 
under reducing conditions. This protein was confirmed to be the OSF-2 
S immunofusin by Western blotting with an anti*0SF-2 antibody. 

It was also found that the expression of OSF-2 as an immunofusin in 
a mammalian system was superior to the expression of OSF-2 in the 
thioredoxin gene fusion expression system in E. coli (LaVallie et al,^ 

10 1993, Biotechnology, 11:167] . The thioredoxin gene fusion system was 
designed to circumvent the formation of inclusion bodies because fusion 
to thioredoxin increases the solubility of many heterologous proteins 
produced in the E. coli cytoplasm. To test this system for the 
es^ression of OSF-2, the cDNA encoding the mature OSF-2 was inserted 

IS into the Smal site of the pTrxFus vector (Invitrogen, San Diego, CA) , 
thus creating a thioredoxin OSF-2 fusion protein. The supplier's 
protocol for the expression of the fusion proteins was followed. The 
. thioredoxin OSF-2 fusion protein was ea^ressed, and, as a control, the 
thioredoxin protein was eaqpressed alone without a fusion partner. The 

20 results showed that although thioredoxin alone could be produced as a 
soluble protein at a hig^ level « the thioredoxin OSF-2 fusion protein 
was present only in the insoluble fraction. Therefore, in addition to 
the lack of post-translational modification in bacterial eiq>ression, a 
relatively con^lex mammalian protein such as OSF-2 was not synthesized 

25 as a soluble protein i^en fused to thioredoxin. 

Bxaaple 11; Bxpresflion of BIQ-H3 inaninofusin 

PZG«K3, a gene product -vAiich is induced by transforming growth 
.30 factor^P. is a SB«kD secretory protein that shares sequence homology 
with OSF-2. The sequence of CDBA encoding PX6-H3 is provided in 
Skonier et al. (1992) USA and Cell Biology* ll:5li, and is incorporated 
herein by reference. The cONA encoding the native PI6-H3 %ras cloned 
into the expression vector pdC? but atteiiq;>ts to obtain stable 
35 transfectants producing pi6-K3 were tmsuccessful. 
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The PIG-H3 cDNA was then adapted to be expressed as an innnunofusin. 
At the 3' end. the BsmI site downstream of the translation stop codon 
was converted to an Xhol site by linker ligation. At the 5* end, the 
following linker -adaptor was used: 
5 5' CCG GGT AAA GCC CTG OGC C 

3 • CA nr CGC one 

{Seq ID, Nos. 10 and 11). The nucleotides in bold encode the N- 
terminus of the mature PIG-H3 protein. Ttie linker-adaptor had XmaX 
cohesive ends for ligating to the eJ^ression vector as described in the 
10 above exao^sles, and Ppal cohesive ends for ligating to the Apal site at 
the 5* end of the cDNA sequence encoding the mature pIG*H3. 

High level eiqiression was obtained in transiently transfected 293. 
cells and stably transfected US/0 cells. -Stable NS/0 clones produced 
IS about 3.5 H9/10^ cells/day of a 100 kD protein when analyzed on a SDS- 
gel under reducing conditions. This protein was confirmed to be the 
PIG-H3 immunofusin by Western blotting with anti-Pl6-K3 antibody. 

Exan^le 12. Expressio n of the soluble form of loE 
20 receptor as an immunofusin 

The high affinity IgE receptor alpha subunit (IgE-R) , the DNA 
sequence of which can be found in Kochan et al. (1988) Nucleic Acids 
Res. 16: 3584 and is incorporated herein by reference « was constructed 

25 as an. immunofusin as follows: An Xnal site was introduced to the 5* 
end of the cDNA encoding the mature ZgS-R so that the sequence at the 

junction of the fusion was C CCG GOT GTC CCT CMI (Seq, ID No. 12) . 

where the ICtaal site is underlined and the three codons in bold are the 
first three amino acid residues of the mature igE-R. At the 3* end of 

30 the IgE-R« the cOHA encoding the transmembrane domain and the rest of 
the C-terminus was deleted and a translation stop codon was placed 
after the last codon of the extracellular domain « The sequence of the 
IgB-R immunofusin at the 3* end was thus 1AC TGO CCA TAA CTC C3AG (Seq. 
ID No. 13) • i^ere the three codons in bold were the last three amino 

35 acid residues of the extracellular domain of the IgE-R, and they were 
followed by a stop codon and an XhoZ site (underlined) . 
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The pdC expression vector containing the IgE-R immunofusin was 
trans^ected into 293 cells and HS/0 cells. High levels of expression 
{3 to 5 |ig/ml) of the IgE-R inuminofusin were detected in the cell 
5 culture media by Bnti(Fc) ELISA. SDS-gel analysis under reducing 

conditions showed a band of the expected size of 70 kD. The partially 
purified protein (on Protein A Sepharose) was shown to bind IgE in an 
IgB-R/IgE ELISA. 

10 Example 13. Evnreasion of FCYl 

Fcfl was expressed by itself without a C- terminal target protein. 
This was achieved by ligating the following linker (having Xmal and 
Xhol cohesive ends) 
15 S< CCG GGT AAA TAB C 

3 • CA TTT ATC GAG CT 

(Seq. ID Nos. 14 and 15). to the Xmal and Xhol sites of the pdC to 
reconstruct the coding region of Fc. High levels of expression was 
detected by anti (Fc) EUSA in the cell culture media of the transiently 
20 transfected 293 cells (5 to 7 fig/ml) and stably transfected NS/O clones 

(5 to 10 |ig/ml) . SDS-gel analysis under reducing conditians showed an 

Fc band of the ejq>ected sire of 31 kD. 



25 



Example 14. p v pr<>flalon of PSMA iBBlMnPfUSin 



PSNA, prostate specific neotorane antigen, is a type II membrane 
protein having a noleeular weight of greater than lOO kD. PSMA is an 
integral mendbrane protein, and as such it is an attractive target for 
imaging and iinminoconjugate delivery. To facilitate the eaqpreaslon of 
30 significant quantities of PSMA. we subcloned the extracellular domain 
of PSMA (the soluble form) and ei^resaed this domain of PSMA as an 
immunofusin. A portion of the extracellular domain of PSMA. which is a 
soluble form of PSMA, can be produced as an immunofusin. 

35 The e5DNA encoding the full length PSMA was cloned from a human 

prostate carcinoma cell line LBCaP Ilsraeli et al. (1993) Cancer Res.. 
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53:227, which publication is incorporated herein by reference]. The 
portion of the PSMA cDNA corresponding to the extracellular domain was 
adapted to be expressed as an innnunofusin by Polymerase Chain Reaction 
using the following primers: 
5 N-terminal: 5' AAGCTT AAA TCC TCC AAT GAA GC 

C-terminal: 5* CTCGAG TTA GGC TAG TTC ACT CAA AG 
(Seq. ID Nos. 16 and 17) . The two primers provide the HindllX and the 
Xhol sites {underlined) for cloning into the immunofusin expression 
vector « In the N*terminal primer, the Hindlll site is followed by the 
10 coding sequence of the extracellular domain of PSMA (in bold) 

immediately after the transmembrane region. In the C-terminal primer « 
the Xhol site is followed by the anticodon of the STOP codon and the C- 
terminal coding sequence of PSMA (in bold) . The amino acid sequence of 
the extracellular domain of PSMA is shown in Seq. ID No. 18. 

15 

High level expression was obtained in stably transfected 290 and 
Sp2/0 cells. The PSMA immunofusin secreted into the cell culture media 
was purified hy Protein A Sepharose. Treatment of the immunofusin with 
the protease plasmin quantitatively converted the 130*)a) Pc-PSMA into 
20 two products: the lOO-kD PSMA extracellular domain and the 3l«*]cD Pc. 
The Fc was then removed from the solution by adsorption onto Protein A 
Sepharose; The soluble PSMA was purified and used to immunize mice. 
It is. expected that an antibody specific only to P34A should facilitate 
diagnosis and therapy of prostate cancer « 

25 

Example 15. EmrMflion of Murine Pc 

The Pc region of murine r2a was prepared for esqpression as an 
immunofiisin. Since the murine Fc region will not be immunogenic to 
30 mice* such an immunofusin containing the murine Fc followed by« for 
exainple« a human protein fusion partner can be used to immunise mice 
directly witihout prior cleavage to get rid of the Fc. The murine Pc 
was cloned into our immunofusin expression vector as described below« 
and was expressed at a hig^ level under our expression conditions • 

35 
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The murine Fc Y2a domain, preceded by the signal peptide described 
above, was cloned into an esqsression vector, pdC, and was expressed 
without fusion to a target protein. Murine Fc y2a cDNA (Sikorav et 
al., 1980, Nucleic Acids Res,, 8:3143-3155. which publication is 
5 incorporated herein by reference) was adapted for cloning into the 
expression vector by Polymerase Chain Reaction using the following 
primers : 

N-terminal: 5' CTTAAG C GAO CCC AOA GGG CCC ACA 
C-terminal: 5' aCGfiSC TCA TXT ACC CGG ACT CCG 

10 (Seq. ID Nos, 19 and 20) . THe N-terminal primer contains an Aflll site 
(underlined) for ligating to the Aflll site at the 3« end of the signal 
peptide, described above. Ttie sequence following the Aflll site (in 
bold) encodes the amino acid residues in the hinge region of murine y2a 
gene« The C- terminal primer contains an Xhol site for cloning into the 

15 expression vector, followed by the anticodons of the translation STOP 
codon and the carboxyl end of murine yi2a. (in bold) . 

High level ea^reasion of the murine Fcy2a region was demonstrated 
in 293 cells by SDS gel analysis followed by Western blotting with an 
20 anti*murine IgG antiboc^* 

Example 16. Emreflgion of qd120 

The envelope protein gpl20 of human imiminodef iciency virus (Hiv) is 
25 a glycoprotein having a molecular weight of l20lcD, and is esqpreseed on 
the surface of HIV particles and mv infected cells. The protein gpl20 
is originally expressed in infected cells as a polypratein,. gpl60, 
which is then cleaved by a cellular protein to gpX20 and gp4i. gpi20 
was prepared as an imnunofusin and determined that the gpI20 
30 immunofusin was ea^ressed at a very high level. Any desired portion of 
gpl20 may also be prepared as inmunofusiri. The Fc inoiety .of the gpl20 
imimmofusin could be cleaved off and gpl20 was purified. 

The cfbmplete nucleotide sequence of HIV has been published in 
35 Ratner et al. (1985) Nature, 313:277, and this publication is 

incorporated herein by reference. To prepare the gpl20 immunofusin, a 



PCTAJS9S/11720 



" -34. 

translation STOP codon followed by an Xhol restriction site was 
introduced to the gpl20-gp4l junction after amino acid Arg-518 of gpl60 
using standard molecular biology techniques, e.g., polymerase chain 
reaction. The existing Ndel restriction site present at nucleotide 
5 5979^ Which is within the amino terminal portion of gpl20, was 

converted to a Hindlll restriction site through linker-adaptor ligation 
to generate and in -frame fusion. The resultant Hindlll-Xhol fragment 
(1.36 kilobase pairs) encoding gpl20 was then cloned into the 
immunofusin expression vector, pdC, as described above. 

The gpl20 immunofusin expression vector was expressed in stably 
transfected 293 cells according to the methods described above, and 
high level expression of the. gpl20 immunofusin was obtained. The gpl20 
.immunofusin was functionally active, as determined by binding to C04 ia 
IS an BLISA. The gpl20 immunofusin was also determined to be 

quantitatively cleaved by enterokinase to release gpl20 and the Fc 
region. 

Other BfftoodiTOntg 

20 

The invention may be embodied in other specific forms without 
departing from the spirit or essential characteristics thereof. The 
present embodiments are therefore co be considered in all respects as 
illustrative and non*reetrictive« the scope of the invention being 
25 indicated by the appended claims rather than by the foregoing 

description, and all changes which come within the meaning and range of . 
equivalency of the claims are therefore intended to be embraced 
therein* 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: 

<A) NAME: FOJI If'WUNOPHARMACEOTICALS CORP, 

(B) STREET: 125 HARTWELL AVENUE 

(C) CITY: LEXINGTON 
10 (D) STATE: MA 

<E) COONTRY: USA 

(F) POSTAL CODE: 02173 

(G) TELEPHONE: (617) 661-5300 

(H) TELEFAX: (€17) 861-5301 
15 (I) TELEX: 

(ii) TITLE OF INVENTIOI: EXPRESSION AND EXPORT TECHNOLOGY OF 
PROTEINS AS IMMONOFOSINS 

20 (ill) NOMBER OF SEQUENCES: 20 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PATENT ADMINISTRATOR, TESTA« HURWITZ & 

THIBEAULT 

2S (B) STREET: 125 HIGH STREET 

(C) CITY: BOSTON 
CD) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110 

30 

(V) COMPUTER READABLE FORM: 

<A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

3S (D) SOFTHARE: Patentin Release #1.0. Version «1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 
40 (C) CLASSIFICATICMa: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: PITCHER, EDMUND R. 

(B) RB6ISTRATI0N NOIGBR: 27,629 

45 (C) RBPfiRENCB/DOCKST NOMBBRs FlP-OOl 

(ix) TELECCMMUNICAXZON IHFORMATiaN: 

(A) TBLSPBONB: 617-248*7000 

(B) TELEFAX: 617-246-7X00 

50 

(2) INFORMATION FOR SEQ ID 110:1: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: tiOm 

(Xi) SEQUENCE DESCRIPTION: SEQ XD N0:1: 

65 
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GAGAATTCTT AAGCGAGCCC AAATCTTCTG ACAAAACTCA C 
41 

<2) INFORMATION FOR SEQ ID NO: 2: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: Single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 



20 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

CCGGGTAAAG GCACAGATGA TGCTACAG 
28 

(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 24 base pairs 
25 (B> TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DBSCUPTION: SBQ ID 110:3: 



35 CTGTAGCATC ATCTXmmT TTAC 
24 

(2) INFORMATIOil FOR SEQ ID N0:4: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2B base pairs 
■ (B) TYPE: nucleic acid 

(C) STRANDEONESS: Single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Cl»IA 



SO (xi) SEQUBNCX OBSCRIPTION: SEQ ID H0:4: 

CCGGGTTCAG GGGATGAOGA TQAO^XA 
28 

55 (2) INFORMATI<» FOR SBQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 
60 (C) STRANDEDNBSS: Single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: CDNA 



65 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AGCTTATCGT CATCGTCATC CCCTGAAC 
5 28 

(2) INFORMATION FOR SEQ ID NO: 6: 

(1) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEmESS: Single 

(D) TOPOLOGY: linear 

IS (ii) MOLECULE TYPE: cDNA 



20 



25 



35 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

CCCGGGTC6C ATGGAG 
16 

(2) INFORMATION FOR SEQ ID NO: 7: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CCCGGOrCGC tOGGCA 
16 

(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEiniESS: Single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



55 CCGGGTAAAA ACAATCATXA TGACAA 
26 

(2) INFORMATION FOR SEQ ZD N0s9: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 
(b) TOPOLOGY: linear 

65 
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(ii) MOLECULE TYPE: cDNA 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GATCTTGTCA TAATCATTGT TTTTAC 
26 

10 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDSDNES5: single 

(D) TOPOLOGY: linear 



20 



40 



45 



55 



60 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE OESCRIPTI(»Y: SEQ ID NO: 10: 



CCGGGTAAAG CCCTGGGCC 

25 19 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOFOXOGY: linear 

35 (ii) MOLECOLE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

CAGGGCTTTA C 
11 

(2) INFCmMATION FOR SBQ ID NO: 12: 



(i) SEQUENCE CHARACTERISTICS: 

(A) X^NGTK: 16 base pairs 

(B) TYPE: nucleic acid 

(C) S7RANDSDNBSS: Single 
50 (D) TOPOLOGY: linear 



(ii) MOLBCDLE TYPE: cWh 



(xi) SEOOENCE DBSCRIPTZON: SEQ ID N0sl2: 

CCCGGGrTGTC CCTCAG 
16 

(2) INFORMATICS FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGIB: 18 base pairs 
65 (B) TYPE: nucleic acid 
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(C) STRANDEDKESS : sir.gle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:13: 

10 TACTGGCTAT AACTCGAG 
18 

(2) IKFORMATION FOR SEQ ID NO: 14: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base paira 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 

20 

<ii) MOLECULE TYPE: cDNA 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:14': 

CCGGGTAAAT AGC 
13 

30 (2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CKARACT6RISTICS: 

(A) LENGIH: 13 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDSrassS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

40 

(xi) SBQUENCB DESCRIPnON: SEQ ID N0:15s 

TCGAGCTAIT TAC 
45 13 

(2) INFC»^T;C^ FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 base piatirfi 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOJY: linear 

55 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

60 

AAGCTTAAIfir CCTCCAIVrGA AGC 
23 

(2) INFORMATION FOR SEQ ID N0:17: 
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10 



15 



25 



35 



50 



.40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CTCGAGTTAG GCTACTTCAC TCAAAG 
26 

<2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 amino acids 
20 (B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..707 

30 (D) OTHER INFORMATION: /note« "EXTRACELLULAR DOMAIN OF 

PSMA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Lys Ser Ser Asn Glu Ala Thr Asn lie Thr Pro Lys His Asn Met Lys 
15 10 15 



Ala Phe Leu Asp Glu Leu Lys Ala Glu Asn lie Lys Lys Phe Leu Tyr 
40 20 25 30 

Asn Phe Thr Gin He Pro His Leu Ala Gly Thr Glu Gin Asn Phe Gin 
35 40 45 

45 lisu Ala Lys Gin He Gin Ser Gin Trp Lys Glu Phe Gly Leu Asp Ser 

50 55 €0 



Val Glu Leu Ala His Tyr A^ Val Leu Leu Ser Tyr Pro Asn Lys Thr 
€5 70 75 80 

His Pro Asn Tyr He Ser He He Asn Glu Asp Gly Asn Glu He Phe 
65 90 95 



Asn Thr Ser Leu Phe Glu Pro Pro Pro Pro Gly Tyr Glu Asn Val Ser 
5S 100 105 • 110 

Asp He Val Pro Pro Phe Ser Ala Phe Ser Pro Gin Gly Met Pro Glu 
115 120 125 

60 Gly Asp Leu Val Tyr val Asn Tyr Ala Arg Thr Glu Asp Phe Phe Lys. 

130 135 140 



65 



Leu Glu Arg Asp Met Lys He Asn C/s Ser Gly Lys He Val He Ala 
145 150 155 160 
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Arg Tyrr Gly Lys Val Phe Arg Gly Asn Lys Val Lys Asn Ala Gin Leu 
165 170 a75 

Ala Glv Ala Lys Gly Val lie Leu Tyr Ser Asp Pro Ala Asp Tyr Phe 
5 180 185 150 

Ala Pro Gly Val Lys Ser Tyr Pro Asp Gly Trp Asn Leu Pro Gly Gly 
195 200 205 

10 Gly Val Gin Arg Gly Asn lie Leu Asn Leu Asn Gly Ala Gly Asp Pro 

210 215 220 

Leu Thr Pro Gly Tyr Pro Ala Asn Glu Tyr Ala Tyr Arg Arg Gly He 
225 230 235 240 

Ala Glu Ala Val Gly Leu Pro Ser He Pro Val His Pro lie Gly Tyr 
245 250 255 

lyr Asp Ala Gin Lys Leu Leu Glu Lys Met Gly Gly Ser Ala Pro Pro 
20 260 265 270 

Asp Ser Ser Trp Arg Gly Ser Leu Lys Val Pro Tyr Asn Val Gly Pro 
275 280 285 

25 Gly Phe Thr Gly Asn Phe Ser Thr Gin Lys Val Lys Met His He His 

290 295 300 

Ser Thr Asn Glu Val Thr Arg He Tyr Asn Val He Gly Thr Leu Arg 
305 * 310 315 320 

Gly Ala Val Glu Pro Asp Arg Tyr Val He Leu Gly Gly His Arg Asp 
325 330 335 

Ser Trp Val Phe Gly Gly He Asp Pro Gin Ser Gly Ala Ala Val Val 
35 340 345 350 

His Glu He Val Arg Ser Phe Gly Thr Leu Lys Lys Glu Gly Trp Arg 
355 360 365 

40 Pro Arg Arg Thr He Leu Phe Ala Ser Tjrp Asp Ala Glu Glu Phe Gly 

370 375 380 

Leu Leu Gly Ser Thr Glu Trp Ala Glu Glu Asn Ser Arg Leu Leu Gin 
385 390 395 400 

Glu Arg Gly Val Ala tyr He Asn Ala Asp Ser Ser He Glu Gly Asn 

Tvr Thr Leu Arg Val Asp Cys «ir Pro Leu Met Tyr Ser Leu Val His 
SO 420 425 430 

Asn Leu Thr Lys Glu Leu Lys Ser Pro Asp Glu Gly Phe Glu Gly Lys 
435 440 445 

55 Ser Leu Tyr Glu Ser Trp Thr Lys hys Ser Pro Ser Pro Glu Phe Ser 

450 455 460 

Gly Met Pro Arg He Ser Lys Leu Gly Ser Gly Asn Asp Phe Glu Val 
465 470 475 480 



30 



45 



60 



Phe Phe Gin Arg Leu Gly He Ala Ser Gly Arg Ala Arg Tyr Thr Lys 
4BS 490 495 



Asn Trp Glu Thr Asn Lys Phe Ser Gly Tyr Pro Leu lyr His Ser Val 
65 500 . 505 510 
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Tyr Glu Thr Tyr Glu Leu Val Glu Lys Phe Tyr Asp Pro Met Phe Lys 

515 520 525 

Tyr His Leu Thr Val Ala Gin Val Arg Gly Gly Met Val Phe Glu Leu 
530 535 540 

Ala Asn Ser He. Val Leu Pro Phe Asp Cys Arg Asp Tyr Ala Val Val 
545 550 555 560 

Leu Arg Lys Tyr Ala Asp Lys He Tyr Ser He Ser Met Lys His Pro 
565 570 575 



Gin Glu Met Lys Thr Tyr Ser Val Ser Phe Asp Ser Leu Phe Ser Ala 
15 580 585 590 

Val Lys Asn Phe Thr Glu He Ala Ser Lys Phe Ser Glu Arg Leu Gin 
595 600 €05 

20 Asp Phe Asp Lys Ser Asn Pro He Val Leu Arg Met Met Asn Asp Gin 

610 615 620 



Leu Met Phe Leu Glu Arg Ala Phe He Asp Pro Leu Gly Leu Pro Asp 

625 . 630 635 640 

Arg Pro Phe Tyr Arg His Val He Tyr Ala Pro Ser Ser His Asn Lys 

^ 645 650 655 



Tyr Ala Gly Glu Ser Phe Pro Gly He Tyr Asp Ala Leu Phe Asp He 
30 660 665 670 

Glu Ser Lys Val Asp Pro Ser Lys Ala Trp Gly Glu Val Lys Arg Gin 
675 €60 €65 

35 lie Tyr Val Ala Ala Phe Thr Val Gin Ala Ala Ala Glu Thr Leu Ser 

690 695 700 



Glu Val Ala 
705 

<2) IHFdRMATIOtl FOR SBQ ID NO: 19: 



(i) 8BQaEE;[CE CHARACTERISTK^: 
(A) LEHGTH: 25 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDSDNBSSs single 

(D) TOPOLOGY: linear 



(ii) MOLECOLB T7PB: cDSA 



(xi) SEQOENCE DESCRIPTION: SBQ ID 110:19: 



55 CTEAAGC6A6 CCCAGAGGGC GCACA 
25 

(2) niFORMAnON FOR SBQ ID N0:20: 

60 li> SEOOENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRAKDEDNBSS: single 

(D) TOPOLOGY: linear 

65 
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(ii) MOLECULE TYPE: CONA 



(xi) SEODENCE DESCRIPTION: SEQ ID NO: 20: 

CTCGAGCTCA TTTACCCGGA GTCCG 
25 



10 
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What is claimed is: 

1 1. A'DNA produced by recombinant DNA techniques for inducing expression 

2 and subsequent secretion of a target protein, said sequence being free of 

3 immunoglobulin CHI and conprising a polynucleotide encoding* from its 5' to 

4 3* direction: 

5 A) a secretion cassette which comprises 

6 a signal sequence; 

7 an immunoglobulin Fc region; and 

8 B) a target protein sequence. 

1 2. The DNA of claim I wherein the target protein sequence encodes a 

2 soluble form of prostate specific membrane antigen. 

1 3. The DMA of claim 1 wherein the target protein sequence encodes at 

2 least a portion of gpl20 protein. 

1 4. l!he DKA of claims 1, 2 or 3 wherein the signal sequence encodes a 

2 signal peptide imich directs secretion of the target protein and is then 

3 removed by enzymatic cleavage. 

1 5. The DHA of claim 1« 2 or 3 wherein the Fc region is altered to delete 

2 at least one effector function activity. .r 

1 €• The DNA of claim 1, 2 or 3 wherein the Fc region con^rises a hinge « a 

2 CH2 domain and a CH3 domain of immunoglobulin gamma* 

1 . 7. The DNA of claim 1« 2 or 3 wherein the Fc region coniprises a hinge 

2 region and a CH3 domain of immunoglobulin gamma. 

1 6« The DMA of claim 1« 2 or 3 further con9>ri8ing a proteolytic cleavage 

2 site interposed 3' of a portion of said polynucleotide encoding said 

3 immunoglobulin Fc region and 5* of a portion of said polynucleotide encoding 

4 said entire target protein. 

1.9. A replicable esqiression vector for transfecting a mammalian cell« said 

2 vector con^rising the DHA of claim 1« 2 or 3. 
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1 10. A host cell transformed with the DNA of claim 1, 2 or 3 . 

I 11, Target protein produced by culturing the cell of claim 10. 

1 12. A method of producing a target protein conprising the steps of: 

2 1) transfecting the DHA of claim 1, 2 or 3 into a host cell; 

3 2) culturing the host cell in a medium under conditions to promote 

4 expression and secretion of a fusion protein con?)rising 

5 an immunoglobulin Fc region « and 

6 a target protein; and 

7 3) collecting the fusion protein from the medium. 



1 13. The method of claim 12 wherein the fusion protein has bioactivity of 

2 the target protein. 

1 14. The method of claim 12 further comprising the additional steps, after 

2 step 3, of 



3 4) cleaving the Fc region from the target protein, and 

4 5) collecting the target protein. 

1 15. A method of producing a target protein comprising the steps of: 

2 1) transfecting the DNA of claim 1« 2 or 3 into a host cell; 

3 2) culturing the host cell In a medium under conditions to promote 

4 esqsression and secretion of a fusion protein comprising 

5 an immunoglobulin Fc region, and 

6 a target proteim and 

7 3) cleaving the Fc region tram the target protein, and 

8 4) collecting the target protein from the medium. 
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